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We show that the Template Overlap Method can improve the signal to background ratio of 
boosted H — > bb events produced in association with a leptonically decaying W. We introduce 
several improvements on the previous formulations of the template method. Varying three-particle 
template subcones increases the rejection power against the backgrounds, while sequential template 
generation ensures an efficient coverage in template phase space. We integrate fe-tagging information 
into the template overlap framework and introduce a new template based observable, the template 
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stretch. Our analysis takes into account the contamination from the charm daughters of top decays 
in tt events, and includes nearly-realistic effects of pileup and underlying events. We show that the 
Template Overlap Method displays very low sensitivity to pileup, hence providing a self-contained 
alternative to other methods of pile up subtraction. The developments described in this work are 
quite general, and may apply to other searches for massive boosted objects. 

^ I. INTRODUCTION 

> 

£ — , Recent results from ATLAS [1] and CMS [2j have confirmed the discovery of a new boson of mass roughly 125 GeV, 

decaying to 77, ZZ* and likely VKW / *(for the Tevatron combination of the Higgs searches see Ref. [3])- The results 

£\j are so far consistent with the interpretation of the new particle as the Standard Model (SM) Higgs boson. 

While the data agrees with the SM Higgs boson, our understanding of the new particle's properties remains in- 
complete. A strong test of the SM Higgs theory consists of a detailed experimental study of its sharply predicted 
characteristics. This includes, among others, observing all the major decay modes of the SM Higgs, as well as es- 
tablishing no deviations from the predicted SM Higgs production and decay rates. Global fits to the Higgs boson 
production and decay rates allow for extraction of its couplings to various other SM fields, as well as possible invisible 
channels (see Refs. [IHS] for recent analyses). A SM Higgs boson of 125 GeV predicts a dominant decay mode to a bb 
pair, which calls for a direct verification. An enormous QCD background, however, makes it rather difficult to observe 
this channel at the LHC. The hadronic Higgs decay mode has so far only been reported by the Tevatron experiments, 
at 2.8(7 in the CDF/DO combinations. Despite an impressive progress by both CMS and ATLAS, the extraction of a 
statistically significant measurement of h — > bb rate from the LHC data at 8 TeV remains challenging. 

One way to reduce the QCD background in h — > bb is by focusing on associated Higgs production with W and Z 
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bosons. However, the cross section of the background pp — > Vbb process is still much higher than the cross section of 
pp — > Vh. Even after a cut on the mass of the bb system around the known Higgs mass, the signal is swamped by 
the background. Authors of Ref. [H] showed that when considering moderately boosted Higgs events, traditional jet 
clustering algorithms with large cone sizes (R ~ 1) can be used to increase the signal to background ratio (S/B). The 
method of Ref. |Jj is based on the fact that decay products of a boosted Higgs are collimated and can be captured 
within a single "fat" jet. This, in principle, reduces the combinatorial background, contamination from soft and 
incoherent components and allows to better characterize the structure of the energy flow within the fat jet. 

The last few years have seen a proliferation of new theoretical and experimental techniques to identify high-p^ 
jets at the LHC (see Refs. [TUHT?] for recent reviews and references therein). Two main classes of approaches have 
emerged: Filtering [5] (see also Refs. [JEHU]) and Template Overlap Method [20] • Filtering algorithms act on the list 
of jet constituents by removing the soft components based on some measure which defines the "hard" part of the jet. 
The remaining constituents are then reclustered into the "filtered" jet. The Template Overlap Method, discussed in 
detail below, does not manipulate the measured jet's list of constituents, neither does it require any special clustering 
algorithm. Instead the method compares the jet to a set of parton level states built according to a fixed-order 
distribution of signal jets called templates. The comparison makes use of an "overlap function" which evaluates the 
level of agreement between each measured jet and a set of templates. 

Ref. [30] focused on building the templates according to the leading order decay modes, namely two body (N = 2) 
for the boosted Higgs and three body (N — 3) for the boosted top. In Ref. [ST], the ATLAS collaboration used the 
Template Overlap Method together with the the HEPTopTagger [22] to search for heavy tt resonances. Authors 
of Ref. [33] showed how to extend the Template Overlap Method beyond leading order as well as how to construct 
templates which describe the energy flow of say h — > bbg in an infrared (IR) safe manner. The method allows one to 
gain access to "partonic-like" observables which correspond to the template configurations with the maximal overlap 
score. The resulting information can further improve the ability to distinguish the signal from various background 
channels. 

Furthermore, Ref. [23] introduced the concept of template jet shapes, such as Template Planar Flow |23H26j and 
template-angularity P3[[S7]). The results of Ref. [23] showed that template overlap is capable of delivering background 
rejection factors of 0(100) against the Wfs background in the idealistic ultra-boosted Higgs regime (i.e. pt ~ 1 TeV), 
when combined with other jet substructure observables. 

In this paper, we examine the decay and radiation patterns of a boosted SM Higgs boson, with focus on a realistic 
Pt kinematic regime (i.e, 300 — 400 GeV). We argue that a boosted Higgs search using the Template Overlap Method 
is viable in the future LHC run. We achieve the best signal sensitivity by combining templates in the full phase 
space for N = 3 and N — 2 overlaps in addition to other template based observables. Moreover, we introduce a new 
variable, Template Stretch, which exploits the difference in plain distance of the two leading 6-tagged subjets relative 
to the signal expectations. 

Our treatment of Template Overlap Method improves on the previous formulations in several ways. First, we define 
templates in terms of longitudinally boost-invariant variables. Second, and more importantly, we entirely revamp the 
method of template generation. In Ref. |23j , the minimum number of templates required to adequately describe the 
jet energy flow in the medium p? range, was roughly two orders of magnitude larger than in this paper. The reason is 
that in Ref. [23J , templates were generated in the Higgs rest frame (with a MonteCarlo-like method) and then boosted 
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to the lab frame on an event by event basis. Generating templates in the lab frame and "tiling" them according to 
the event kinematics leads to better coverage of phase space at lower px and to a great improvement in the overall 
performance of the analysis. Third, we introduce 6-tagging into the Template Overlap framework. Information about 
6-jets combined with peak templates serves to improve the rejection power, defined as the signal efficiency divided by 
the efficiency for the background. Finally, the optimal radius of the template subcones is not necessarily the same for 
every parton in a template, as low momentum subjets tend to have wider angular profiles. We allow the three-body 
template subcones to vary with px providing a more adequate description of the showering patterns within a fat jet. 
Varying cones improve the tagging performance of three-body overlap as well as most of the other template-derived 
observables. 

Our analysis includes nearly-realistic effects of pileup and underlying event (UE). In high luminosity environments, 
the large jet cone radius allows for severe effects of pileup on the spectrum of both jet and substructure observables. 
We show that the template jet shapes give us an additional handle on pileup, as they are based on best matched 
templates and not jet constituents. The "spikiness" of the jet energy distribution naturally avoids the complication 
of soft un-correlated backgrounds. The Template Overlap Method is thus less susceptible to pileup compared to 
other kinematic observables such as jet invariant mass and px- This feature invites us to re-consider several jet-shape 
observables in terms of the "partonic" distribution of the peak templates. For instance, jet Planar Flow (Pf ) is known 
to exhibit high susceptibility to pileup [281 |29] , however it is a useful background discriminant when the hard and 
coherent part of the massive jet is considered [2H [UJ [3U] . We thus introduce a pileup- insensitive alternative to Planar 
Flow constructed from the template states alone. 

We use parton-shower simulations to illustrate the non-susceptibility of the Template Overlap Method to a high- 
pileup environment. Our results agree with the 7TeV ATLAS data analysis in Ref. [3T], which showed that the overlap 
method is indeed fairly robust to presence of moderate pileup contamination. 

In Section[TTJ we give an overview of the Template Overlap Method and introduce several template based observables 



sensitive to the QCD radiation patterns. Section III describes our Monte Carlo (MC) data generation, and shows 
the results for boosted Higgs searches with a mass of 125 GeV at y/s — 8 TeV and y/s = 13 TeV. Section |III| also 
contains a detailed discussion of pileup effects. We give a detailed review of template Planar Flow in Appendix \K\ 
while Appendix [B] describes the new method of template states generation. 



II. TEMPLATE OVERLAP METHOD 

Template Overlap Method is based on the quantitative comparison between the energy flow inside physical jets and 
the energy carried by partons modeled after the boosted signal events (templates). We define libraries of templates 
as sets of N four-momenta |/) = \pi,P2," ■ ,Pn) representing the decay products of a SM Higgs boson at a fixed 
momenta P and Higgs mass m^: 

N 

Y,Pa=P, P 2 =™l (1) 
a=l 

We require that the N quanta of energy be captured within an anti-fcy jet of varying size, scaled according to the 
Pt of the Higgs (or according to the px of the associated vector boson). The number of partons in the templates 
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is not necessarily fixed, but is calculated in fixed order perturbation theorjQ and "next-to-leading-order" templates 
with more than the minimum number of partons are possible. Below we focus on combining the information from 
templates in the full phase space for N — 3 and N = 2 partonic configurations, and show that they provide some 
additional rejection power against the dominant Wbb and tt backgrounds. 

Next, a functional measure quantifies the agreement in energy flow between a given Higgs decay hypothesis (a 
template) / and an observed jet j. A scan over a large set of templates that cover the iV-body phase space of a Higgs 
decay results in f\j], the template which maximizes the functional measure. Our primary jet substructure observable 
is the iV-body overlap Ovjy, the value of the functional measure for the best matched template. 

For each jet candidate, we define the (maximum) overlap as 

2" 



Ov N (jJ[j}) = max 



exp 



N i I V 



(2) 



where {/} collectively denotes a template library for the given jet pt, Pr,a is the transverse momentum of the a th 
template parton and px,i is the transverse momentum of the i th jet constituent (or calorimeter tower, topocluster, 
etc.). The first sum is over the N partons in the template and the sum inside the parentheses is over jet constituents. 
The kernel functions F(n, n a ) restrict the angular sums to (nonintersecting) regions surrounding each of the template 
momenta. We refer to the template state which maximizes the functional measure as the "peak template" f[j] of the 
jet j. The parameter e allows us to correct for the energy not captured by the template overlap. In the case of N = 3, 
we use e = 0.8. For more details see Section ["II Bl 

In this analysis, we take the kernel function to be a normalized step function that is nonzero only in definite angular 
regions around the directions of the template momenta p a : 

\ 1 if AR < r a 

F(n,i,n a ) = < , (3) 

I otherwise 

where AR is the plain distance between the template parton and a jet constituent in the (f),(f>) plane. The parameters 
r a determine the angular scale of the template subjet. Together with the energy resolutions er a , these are the only 
tunable parameters of the model. 

A few ideas for possible strategies to determine the values of R a and a a are listed below: 

• Choose the single best parameter according to some optimization criterion (e.g., optimize the tagging efficiency 
and background rejection), and use the same values for all the partons within a set of templates. 

• Choose the parameters separately for each template, e.g. using a p^-dependent scale for template matching. 
Based on a combination of these two criteria, we fix a a (for the ath parton) by that parton's transverse momentum, 

<r a =pr,a/3- (4) 
We use a subcone of radius r 2 = 0.3 for the two-body template analysis, while the three-body subcone is dynamically 



determined on a template-by-template basis. Section II B contains a detailed discussion on the optimal scaling of 



In principle, one can re-sum the soft radiation from each of the partons. Here we simply stick to a fixed order perturbation theory 
description. 
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subcone radius. We should nevertheless emphasize that the overall performance of the method can be maximized for 
a wide range of r by rescaling other parameters. 

Next, we generate libraries of O(10 4 ) 2-body templates and O(10 6 ) 3-body templates in steps of Higgs px of 
30 GeV starting from 315 GeV. For each event, we achieve the best signal sensitivity by dynamically selecting a set 
of templates based on the transverse momentum of the W boson, 



where p T l ^ lin , Pxmax are tne limits of the bin corresponding to the template set with p^ mp — (p^max — Prmm)/^- ^ OT 
instance, an event with p^ = 320 GeV would be analyzed by a template set with pt = 315 GeV etc. We give more 
details on template properties and generation in the Appendix. 

Replacing the criteria for the template set selection from jet pr to p^ has a enormous advantage when considering 
effects of pileup, however it is not the only choice. As an alternative, one could analyze each jet with all template 
sets, but at a huge expense in computation time. 

We use the TemplateTagger [3T] numerical package for the template matching analysis]^] The package is a CH — h 
code which provides basic implementation of the Template Overlap Method for jet substructure, as well as several 
other jet analysis tools. 



Template overlap provides a mapping of final states j to partonic configurations f[j] at any given order in perturba- 
tion theory. Once the best matched template f[j] is found, it can be used to characterize the energy flow of the state, 
giving information on the likelihood that the event is signal or background. The scope of template overlap does not 
stop with Ov2 and Ov^. Peak templates contain additional information about correlations within the fat jet, some of 
which we explore in the following sections. 



Of particular value are angular correlations between template momenta which can otherwise be concealed in the 
numerical values of the peak overlap. For instance, the angular distribution of jet radiation can be measured with the 
variable 9 [53], defined as 



where ARij is the distance in the (ry, cf>) plane between the i th template partonic momentum and the jet axis. When 
measured using three-body templates, the 9 variable exploits the fact that signal events tend to have smaller emission 
angles. Notice that for highly boosted jets, the 2-body version of 9 simply reduces to the angle between the two 
partons [23] . 




(5) 



A. Other Peak Template Observables 



1. Angular Correlations 




(6) 



2 Publicly available at: 
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2. Template Planar Flow 

Another useful background discriminant is Planar Flow [23J[5S] (see Ref. [25] for a recent study of the Pf distribution 
of QCD massive boosted jets). To define the Pf variable we first introduce the "jet inertia tensor" as 



-'--£#• (7) 

mj ^ pf 



Here mj is the jet mass, and i runs over the jet constituents. We choose to use the boost-invariant definition for I kl , 
whereby we define the two dimensional vectors p\ as 

p?=pf(»?i»&). (8) 
with rji, (pi are measured relative to the jet axis. The Pf jet shape is given by 



Pf=iM2 ( 9 ) 
tr(7) 2 ' 



Notice that because of the fact that the trace of (l kl ) is proportional to the jet mass [32], Planar Flow is only well 
defined for massive jets |29j . Planar flow is particularly helpful in distinguishing energy flow distributions which lie 
on a line (Pf — > 0) from uniformly distributed energy flow (Pf — > 1). For instance, Planar Flow of a boosted Higgs 
will tend to be smaller than that of a massive QCD jet that in turn will be smaller than that of a boosted top or 
gluino [33] . 

Planar flow of a jet is useful when considering only the hard and coherent part of the jet. Because of high sensitivity 
of jet Planar Flow to pileup and UE, here we consider Template Planar Flow (tPf) as an alternative. The advantage 
of tPf is that it is constructed purely out of peak template states and thus less susceptible to pileup. We demonstrate 
this point using a Monte Carlo simulation in Section |IIIB| The non zero mass of the peak template states guarantees 
infra-red safety of tPf. We define tPf using peak template momenta as well as the template subcones to include 
physical effects of energy smearing. Appendix [A"] gives a more detailed discussion. 



3. Template b Identification 

A simple but useful way to reject backgrounds and accept signal events is to incorporate the information related to 
6-tagging into the template overlap framework. The identification of 6-tagged jets relies on information beyond what 
is provided by the calorimeters (say from the presence of a displaced vertex and/or a hard lepton). This information 
(e.g. direction of the 6-tagged jets in r\ and <fi) is fairly uncorrelated with the information about the direction of the 
peak template partons. We integrate the ^-tagging information into the template overlap framework by assigning a 
6-quark tag, t b to each peak template f[j]. A two body template parton is assigned a &-tag if an anti-fcr (r = 0.4) 
6-jet lies within a template cone of radius r 2 around the template parton axis. For simplicity, we take a jet to be 
6-tagged if it has pt > (default: p^ s = 20 GeV) and contains a b or b quark. 

Information about template &-tags can be of particular use in discriminating the large tt background. For instance, 
consider a typical doubly-6-tagged jet coming from a tt event. A fragment of another light (or c) jet is likely to fall 
in the cone of R ~ 1. If only the criterion of a doubly fo-tagged jet is used, there is no guarantee that the peak two 
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body template will select the two b subjets, making the it event more likely to pass the kinematic constraints of 
the template states. On the contrary, if we require that the 6-tagged jets coincide with the template momenta, we 
discriminate against jets in which only one 6-jet is tagged by the template. Notice that the effect of ^-tagging on the 
templates should not have a large effect on the signal. 



4- Template Stretch 



In order to increase the signal efficiency of the Tempate Overlap Method we chose a working point where the energy 
resolution of each of the template partons is rather loose (i.e. a a = Pr,a/3). This implies that even after both OV2 
and OV3 cuts the mass distribution for the background events is still broad and certainly more spread than that of the 
signal. This, as well as the fact that for template 6-tagging we require a large r = 0.4 anti-fcr jet parameter (motivated 
by the current experimental defaults), implies that the angular distance between the partonic, "6" candidates with a 
high Ov2 score would still have some smearing with respect to the actual distance between the two anti-fcy 6-tagged 
jets. To capture this effect, we define a new observable, Template Stretch, as 

= ^r- ( 10 ) 

bb AR t K ' 

where AR t is the distance between the peak two-body template momenta and A.R b i is the distance between the two 
6-tagged subjets. We expect that the background events will have a broader distribution of compared to the 
signal events. The functionality of the template stretch is correlated with the mass of the jet, but with an important 
advantage. The mass of a fat jet is subject to a large jet cone radius R ~ 1, making it highly susceptible to effects of 
pileup and underlying event. Since S^jj is constructed out of subjets with r = 0.4 and template states, it is bound to 
be less sensitive to pileup. We will illustrate this point further in the following sections. 



B. Varying Subcone Templates and Showering Correction 



Fixed template cones are limited by the fact that different pt subjets yield a different energy profile in 77, (f>. This 
fact is important for three body template analysis, where we expect the px of the three peak template partons to 
be non-uniform. For instance, one would find that a template subcone of radius r = 0.05 is adequate to capture 
the radiation pattern of a 500 GeV quark. Yet, the same subcone would completely fail to adequately describe the 
radiation pattern of a quark with energy of 100 GeV, resulting in a poor overlap score. We thus introduce the concept 
of scaled three body subcones into the template overlap framework. Varying template subcones allow us to correct 
for energy deposition outside the template subcone radii. This in turn leads to an an improved template-level energy 
resolution while keeping systematic uncertanties well under control^] Jets become narrower as pr increases, meaning 
that a smaller jet area is needed to collect some fixed fraction of the jet energy at higher transverse momentum. The 
corresponding distribution of jet areas (at a fixed energy fraction) is generically non calculable but is measured by 
experiments, and is commonly denoted as the "jet shape" variable. Refs. [31] and [35] present ATLAS and CMS 
studies of the jet shape variable respectively. 



3 At low pt, the leakage of QCD radiation outside the template subcones can be especially large when using fixed subcones. To account 
for this, one has to include energy correction factors into the px of the templates, which are largely affected by systematic uncertanties. 




FIG. 1. Jet shape results on optimal template subcone scaling. The blue, solid curve shows the minimal anti-fcr radius 
necessary to capture 80% of jet energy, obtained from ATLAS data. The grey, dashed curve is the ATLAS result shifted down 
by Sr — 0.063, in order to match the optimized value of r3(100 GeV) = 0.12 (large, red dot). The orange, dotted line is the 



naive scaling of Eq. (111. 



We used the ATLAS jet shape study in Ref. [31] to establish a scaling rule for the template subcones. The 
differential jet shape drops rapidly as r increases: at low px, more than 80% of the transverse momentum is contained 
within a cone of radius r = 0.3 around the jet direction. This fraction increases up to 95% at very high px- We fit 
the numerical values of the integrated jet shape in different px regions to obtain the minimum cone radius required 
to capture 80% of the jet transverse momentum, fn the overlap analysis, we correct for the 80% efficiency by scaling 
the template px accordingly. Fig. [T] shows the result of our dynamical scaling. The points represent the minimum 
radius necessary to capture 80% of jet's energy as a function of jet px- The resulting curve gives a shape to the 
optimal scaling rule for template subcones. The error bars on the data points are small enough that they can be 
omitted for the purpose of our analysis. To obtain the subcone values in the px < 30 GeV region, we extrapolate 
the data. An overall shift in the jet-shape curve remains a free parameter. To calibrate it we choose a benchmark 
point of ^(100 GeV). We demonstrate in the following section that other calibrations are possible and may perform 
in a similar manner, within a reasonable range. The dotted curve shows that a shift of Sr = 0.063 units provides an 
excellent fit. Notice that the naive scaling 

r 3 (^)=r 3 (100GeV)^-^, (11) 

Px 

is in excellent agreement in the px > 60 GeV region, while the discrepancy with data becomes large at lower jet px- 
Fig. [2] shows an example of the effect of varying subcones on peak templates compared to fixed ones. The blue, 
dotted circles represent the peak two body template with radius ri = 0.3. The red, dashed circles are peak three 
body templates with a fixed r$ = 0.12 (left panel) and varying r% (right panel). Notice that a higher percentage of 
the lowest px subject is "encompassed" by the varying cone resulting in an overall increase in the OV3 score. 

Varying subcones improve the performance of templates on the distribution level as well. Fig.[3]shows a comparative 
example for Ov%. The panel on the left was obtained using a fixed r^ — 0.12 while r^ was allowed to vary according to 
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(0v2 = 0.886001 , Ov3 = 0.705925) (Ov2 = 0.886001 , Ov3 - 0.770652) 




FIG. 2. Template analysis of a boosted Higgs jet. Left panel shows a Higgs jet analyzed with fixed three body subcones of 
radius r% — 0.12. The right panel corresponds to a repeated the analysis using the scaled subcones shown in Fig. [I] Grey 
squares represent the jet constituents, the pr of which is proportional to the size of the square. The solid circles are positions 
of b-quarks in the hard process. 
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0.2 0.4 0.6 0.8 1.0 

Ov 3 {p T >300 GeV, Ov a >0.S) 




0.2 0.4 0.6 0.8 

Ot) 3 (j» r >300 GeV, Ov 2 >0.S) 



FIG. 3. Template observable distributions for doubly-fo-tagged events with Ov2 > 0.8 and 6-tagged 2-body templates (r2 = 0.3) 
for p™ > 300 GeV and no mass cut. Jets are reconstructed using FastJet, and the anti-fey algorithm with a varying cone size 
R (here denoted "ankt". See more details on R scaling below). The left panel represents distributions with = 0.12. The 



right panel represents the same distributions using the scaled subcones of Eq. (Ill 



the scaling rule of Fig.[T]in the right panel. Notice that the varying subcones result in background overlap distributions 
which are significantly more peaked in the region of low overlap values. On the other hand the signal remains mildly 
affected, resulting in improved performance. 



C. Stability of the Template Overlap Method 



How sensitive is the Template Overlap Method rejection power to the choice of subcone radius r s h ap e = ^(lOOGeV^) 
and the overall normalization of the working curve of Fig. [T]? Significantly modifying the overall scale for subcone 
radius r s h a pc does alter the distribution of our kinematic variables; in particular, reducing the size of r s h ap c tends 
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to shift Ov2 and OV3 to lower values and vice versa. The sensitivity of the template method to the specific choice 
of r s hapc calls for a dedicated experimental study on a control sample to fix the corresponding value. One can use 
either boosted top analysis in the signal area (in fact the Ov^ distribution of boosted tops and their backgrounds was 
already studied experimentally by ATLAS in Ref. [21]) or looking at hadronic W inside a boosted top jet as a way 
to experimentally analyze the above dependence. The sensitivity to r s h ap e ls of course not unique to our proposal, 
as even if fixed cones are used, all jet substructure methods will depend on the choice of the corresponding subcone 
parameters. Varying the subcone size to keep the enclosed energy fraction fixed is in fact a more covariant way to 
proceed with the substructure analysis. We further wish to point out that it is in general quite possible to choose 
values of cuts on Ov2 and OV3 such that the overall signal and background efficiencies are essentially the same for 
different subcone radii. To illustrate this point, in Fig. [4] we show curves of rejection power for several choices of the 
subcone parameter r s h ap c- The meaning of r s h a pc m Fig. [4] is defined as the choice of ^(100 GeV), while the actual 
value of r3 is allowed to vary according to the scaling rule in Fig.[T] The blue squares represent rejection power against 
the Wbb background as a function of the three-body subcone radius ^(lOOGeV), and a fixed Ov^ > 0.6 cut, while 
the Ot>3 cut is allowed to scale with r s h apo to provide a five percent efficiency for each r s h apo ■ Signal and background 
efficiency of a fixed Ov 3 cut shift with the choice of r s h apc , but disproportionately. Rigid Ov 3 cuts thus show high 
sensitivity to the choice of r s h apc as shown by the blue squares in Fig. [4] Alternatively, fixing signal efficiency also 
fixes the background efficiency for a wide range of r s h ap e, thus preserving the rejection power over a wide range of 

^"shape (^-G. ^shapc ^ 0.12). 



Stability of overlap rejection power 
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■ ■ Rigid Ov3 cut 

a a Variable Ov3 cut (Eff. = 5 %) 
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FIG. 4. Sensitivity of Wbb rejection power to the choice of radius r a h apo = r3(100 GeV). The blue line shows rejection power 
with a rigid Ovz > 0.6 cut. The green line is the rejection power with a cut on Ov?, varied with r s h ap c, while keeping a fixed 
efficiency. The rejection power is relative to cross sections with no Basic Cuts (see next section). 
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Pythia vT = 8 TeV, ankt R=1A 
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FIG. 5. The left panel shows the correlation between the Higgs-jet pr and the pr of the associated W for a cone of fixed size 
R — 1.4. The right panel shows the same correlation with the jet cone rescaled according to p^ as in Eq. (121. 



III. DATA SIMULATION AND ANALYSIS 

In this section, we investigate the tagging efficiencies for Higgs jets and the mistag rates for QCD backgrounds using 
template overlap. We consider events in which a boosted H — > bb jet is produced in association with a leptonically 
decaying W boson (only first two generations of leptons). The most dominant backgrounds for this process come 
from tt and Wbb, while other channels do not significantly contribute after 6-tagging requirement. 

The scope of our analysis includes data at both s/s = 8 TeV and y/s = 13 TeV with and without pileup. We 
use MG/ME V5.1.33 [36 interfaced to Pythia v6 [37] (with MLM matching [38]) as well as Sherpa vl.4.0 [39] 
(with CKKW matching [5D]) for data generation and the CTEQ6L1 [IT] parton distribution functions. We perform 
jet clustering with a Fastjet [12] implementation of the anti-fey jet clustering algorithm. To simplify the notation 
from here on we solely refer to Pythia and Sherpa results and leave the fact that all our data samples are matched 
implicit. Our Sherpa simulations serve to illustrate the effects of various showering algorithms. We only include 
the Wh and Wbb data in the comparison, as the tt distributions are characterized by hard scales and therefore less 
sensitive to detail of the showering. We scale the fat anti-fey jet radius according to the W momenta 

200 GeV 



R = max 1.4 



w 
Pt 



,0. 



(12) 



where p^ is the transverse momentum of p l + p'x ■ Note that the momentum of the W is highly correlated with 
the momentum of the Higgs as they recoil against each other. However, scaling the cone according to pt^ has an 
advantage in that it is not susceptible to pileup. 

Continuing, we limit the value of R from below to be higher than 0.8 as to be able to accommodate two 0.4 anti-fey 
jets used for 6-tagging. The scaled fat-jet cone fulfills three tasks; it is designed to capture the bb at a fixed efficiency 
rate of ~ 80% ; it reduces the amount of contamination from soft radiation and pileup for events with high px Higgs 
jets; and it lowers the overall tt background. Note that the scaling rule of Eq. p"2| has only a minor effect on the 
correlation between the Higgs fat jet momenta and that of the W as is shown in Fig. [5] 

Next,we normalize our data to next-to-leading-order (NLO) cross sections obtained from MCFM 6.3 [43 . The cross 
sections assume a fixed renormalization/factorization scale of n = pip m — 300 GeV at 8 TeV and p = pip m = 400 GeV 
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at 13 TeV and CTEQ6.6M partem distribution functions. For each event, we find the jet with the highest 
transverse momentum j and impose the following Basic Cuts: 



P J T > PT ■ 

r)j,r)i < 2.5 . 
N b = 2 : 

N k (p T > 20 GeV) < 2. 



i> T > 40 GeV , 
ARbb > 0.4 , 
Ni{p T > 20 GeV) = 1 



(13) 



where Nk,i is the number of jets (anti-fcr, r = 0.4) and leptons outside the highest px fat jet (of radius R) and Nf, 
is the required number of ^-tagged (anti-fey T — 0.4) subjets. For a fat anti-fey jet j (of radius R), and an anti-fey, 
r = 0.4 jet k, a jet is considered to be outside the fat jet if the plain distance AR(j, k) > R + r. Similarly, a lepton I 
is considered outside if AR(j, I) > R. Table [I] summarizes the cross section results with and without Basic Cuts. 

We consider p™ m = 300, 350 GeV respectively at 8 TeV and 13 TeV. For ^-tagging we assume an efficiency of 75% 
and fake rate of 1% for light jets [45]. The current studies suggest a charm fake rate of 18% [45], which is likely a 
conservative estimate. Charms are extremely important when considering boosted Higgs decays, as the largest part 
of the ti background comes from events in which one top decays leptonically, while the hadronic W from the other 
top decays to a charm. We emphasize that omitting the charms as a source of background (as is done in some of the 
boosted Higgs analyses) will result in an improved performance for our tagger. Yet, at present, it is not clear whether 
this is possible, and the burden of proof is thus placed on the experimental collaborations. 

We analyze the cases with and without pileup separately in order to illustrate the sensitivity of the Template 
Overlap Method to a pileup environment. This allows us to determine the range of background rejection power as a 
function of the efficiency of pileup subtraction. In addition, it also allows for a comparative study of the various jet 
substructure observables in a pileup environment. 



fb 


ti 


Wbb 


Wh 


S/B 


a{ = 8 TeV, p^ > 300 GeV) 
ct( y/s = 8 TeV, Basic Cuts) 


565.0 
2.0 


56.0 
2.5 


1.6 
0.2 


0.05 


<r( y/s = 13 TeV, > 350 GeV) 
cr( y/s = 13 TeV, Basic Cuts) 


956.0 
3.0 


47.0 
1.7 


1.2 
0.3 


0.06 



TABLE I. NLO signal and background cross sections at y/s = 8 TeV and y/s = 13 TeV. The listed numbers assume a 
leptonically decaying W with first 2 generations of leptons included. Basic Cuts include Eq. (131 as well as the 6-tagging 
efficiencies. 



A. Higgs Tagging with Template Overlap - No Pileup 

We proceed to discuss the ability of the Template Overlap Method to discriminate between different sources of 
coherent QCD radiation. 

In terms of pileup filtering, analysis without pileup is equivalent to stating that the efficiency of pileup subtraction 
is 100%. In this section we present only the results on jets with pr > 300 GeV, simulated at y/s = 8 TeV, while 
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we postpone the discussion of the future 13 TeV LHC run until upcoming sections. The first important feature of 
the Template Overlap Method is that it is designed to identify a particular kinematic jet substructure configuration, 
including the jet px and mass. High peak overlap score implies that the kinematics of a fat jet matches the kinematics 
of the peak template state. In Fig. [6] we plot the jet mass distribution without (left panel) and with (right panel) 
template overlap cuts of Ov 2 > 0.9 and Ov^ > 0.8, after the Basic Cuts of Eq. ( 13 ) have been applied. It is evident 
from Fig. [6] that sizable chunk of the background is removed as a result of the overlap cuts though the resolution of 
the fat jet Higgs mass is only moderately improved. 



Pythia vT = 8 TeV, ankt R-l/jf 



Pythia vf= 8 TeV, ankt R~l/p% 



1 1 Wh 
1 1 Wbb 

□ ft 




























































J 
















j 1 













40 60 80 100 120 140 160 180 200 
m(Basic cuts) 



Wli 
Wbb 




40 60 80 100 120 140 160 180 200 

m(Oi> 2 >0.9, 2ij, Ou 3 >0.8) 



FIG. 6. Invariant-mass peak searches with templates. The left panel shows the mass distributions with no cuts on template 
observables. The right shows the same distributions when overlap cuts are applied to the data: Ov2 > 0.9 and 0«3 > 0.8. 



The mass resolution of the peak templates depends largely on the chosen parameters of the method, namely the 
template cone radii r a and their energy resolution a a which we choose rather loosely as to keep the signal efficiency at 
a reasonable level. Recently, authors of Ref. [5T] presented a jet substructure analysis of boosted ti pairs at ATLAS. 
Their results showed that even with a high peak overlap cut, an additional mass window improved the background 
rejection power by a factor of two. Our result in Fig. [6] agrees with the ATLAS result. It appears that even after 
the overlap cuts, a mass window of (say) 110 GeV < rrij < 130 GeV would improve the background rejection power 
(this would, however, require an additional procedure of pileup removal). In the following sections we remain agnostic 
about this issue and show results with an without a mass cut. 



Fig. [7] shows distributions of several template-inspired observables obtained from both the Pythia and Sherpa 
data. Since our focus is on the difference in the shapes of various observables, all of the kinematic distributions are 



shown after cuts slightly different from the ones of Eq. (14). Ref. [23 showed that at very high px (say above the 
TeV scale), 8 displays a sizable rejection power. Our result shows that at lower px, in the region where small R 
approximation does not hold, the background discriminating power of 8 is highly diminished. In addition to Ov^, tPf 
and especially Ai?bb/ ARt appear to be promising variables. We discuss tPf in more detail in Appendix |A} 





FIG. 7. Distributions of various substructure and template-based observables obtained from Pythia (left) and Sherpa (rij 
Basic Cuts are applied to all panels in addition to the cuts specified in the label. 
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1. Background Rejection Power at y/s = 8 TeV 

We proceed to discuss the rejection power of the method for jets with px > 300 GeV at y^I = 8 TeV. For the 
purpose of illustration, we consider several combinations of cuts on both template and jet observables, while we leave 
Ov™ m a free parameter. We label the cuts as following: 



Cuts 1 : Ov 2 > 0.9. 

Cuts 2: Ov 2 > 0.9, 2t b . 

Cuts 3 : Ov 2 > 0.9, 2t b , AR bb /AR t < 1.0. 
Cuts 4 : Ov 2 > 0.9, 2t b , tPf < 0.3. 

Cuts 5 : Ov 2 > 0.9, 2t b , AR bb /AR t < 1.0. 

110 GeV < m < 130 GeV, 



(14) 



where 2t b denotes that both two-particle peak template momenta are 6-tagged. 
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FIG. 8. Background rejection power of the template analysis. The left panel shows the overall efficiency and fake rate with 



fixed cuts of Eq. (141. A cut on On™" 1 runs along the curves. The right panel shows rejection power for the tt and Wbb 



separately for Cuts 1. All efficiencies are relative to Basic Cuts of Eq. ( 13 1 



Fig. [8] summarizes the results. The left panel shows rejection power obtained from our analysis, with and without 
a mass window cut. The signal efficiency and fake rates are measured relative to the cross sections with Basic Cuts 
from Table |Ij The curves also include ^-tagging efficiencies we discussed the previous section. Each curve represents 
fake rate as a function of signal efficiency with a set of fixed cuts on template observables, while a cut on Ov^ in 
the range of (0, 1) runs along the curve. Our results show that template observables can significantly improve the 



background rejection power relative to Basic Cuts of Eq. (13). Fig. 10 illustrates the rejection power over individual 
background channels. Template Overlap method alone performs significantly better in rejecting Wbb events for most 



signal efficiencies, as shows in the left panel of Fig. Fig. 10 This is reasonable since tt events typically consist of two 
6-tagged jets and an additional fragment of a hadronically decaying W boson. Such a configuration is more likely 
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to be tagged with a higher OV3 score than the typical two body substructure of a light QCD jet. However, cuts on 
additional kinematic observables such as the Template Stretch or tPf, as well heavy flavor tagging requirements, can 
result in ti events being rejected at a rate higher than Wbb. 
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FIG. 9. Background rejection power of the template analysis for individual background channels. The left panel shows the 
overall efficiency and fake rate for ti and Wbb separately with Cuts 1 of Eq. (141. A cut on O113"™ runs along the curves. The 



right panel shows the signal efficiency and fake rate for Cuts 3 of Eq. ( 14 \ . All efficiencies are relative to Basic Cuts of Eq. ( 13 1 



Table [TT] shows an example of benchmark signal efficiency points. At 14% efficiency, a rejection factor of w 4 is 
achievable with template based observables only. Additional mass window boosts the rejection power by rs 25%, 
leading to a S/B « 0.3, with roughly 20% efficiency. 

Template Overlap Method can achieve enough rejection power at 8 TeV to overcome the backgrounds at the cost 
of signal efficiency. Yet, the current estimates for integrated luminosity of the LHC 8 TeV run are not enough to yield 
practical results, even if we could combine the CMS and ATLAS data. We thus turn to projections for the future 
13 TeV run. 



2. Background Rejection Power at y^i = 13 TeV 

The composition of background channels at y/s = 13 TeV changes relative to 8 TeV, with ti amounting to 60% 
of the total. Higher center of mass energy also allows us to push the minimum px to higher values; we opt for 
Pt > 350 GeV. Fig. [10] shows the result. We again find that template overlap can significantly improve the rejection 
rate over traditional jet observables. The overall performance of templates improves relative to y/s — 8 TeV and 
Pt > 300 GeV, as expected. 



Fig. 10 shows that an overall rejection power of « 10 (Eff: 20%) is achievable at = 13 TeV, leading to an overall 



S/B ^ 0.5. This constitutes an improvement over the 8 TeV result where the maximum rejection power was ~ 5. 
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PYTHIA 



SHERPA 



Cut Set 


Ovf™ 


Wh efficiency (%) 


Wbb fake rate (%) 


tt fake rate (%) 


overall rejection power 


Cuts 3 


0.3 


39.0 


20.0 


16.0 


2.1 


Cuts 3 


0.8 


27.0 


10.0 


10.0 


2.7 


Cuts 4 


0.3 


20.0 


9.0 


4.0 


3.0 


Cuts 4 


0.8 


14.0 


5.0 


2.0 


4.0 


Cuts 5 


0.8 


23.0 


6.0 


3.0 


5.0 


Cut Set 


Ov^ m 


Wh efficiency (%) 


Wbb fake rate (%) 


tt fake rate (%) 


overall rejection power 


Cuts 3 


0.3 


39.0 


17.0 




2.3 


Cuts 3 


0.8 


26.0 


8.0 




3.1 


Cuts 4 


0.3 


23.0 


11.0 




2.1 


Cuts 4 


0.8 


15.0 


5.0 




2.9 


Cuts 5 


0.8 


17.0 


4.0 




4.3 



TABLE II. Background Rejection Rates at ^J~s — 8 TeV. The values in the table show the signal efficiencies and fake rates 



relative to the cross sections with Basic Cuts of Eq. ( 13 1. The overlap rejection power includes both the Wbb and tt. 



Pythia A- = 13 TeV, anktii:=0.8 
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efficiency (%) 



40 



FIG. 10. Background rejection power of the Template Overlap Method at ^/s — 13 TeV. The left panel shows the overall 



efficiency and fake rate with fixed cuts of Eq. (14 1. A cut on Ou™ m runs along the curves. All efficiencies are relative to Basic 



Cuts of Eq. ( 13 \ 



B. Higgs Tagging with Template Overlap - Effects of Pileup 

A foe to most jet substructure observables, pileup has become an LHC fact of life. The strive for high luminosity 
resulted in pileup levels of whopping 20 average interactions per bunch crossing during the current 8 TeV run. Pileup 
events contribute both to the fat jet constituent multiplicity and the energy distribution within a jet, resulting in 
possibly dramatic effects on any jet substructure observable constructed out of jet constituents. 

Jet mass is perhaps the best illustration of this point. Fig. [TT] shows an example. The left panel shows mass 
distributions in the presence of average 20 interactions per bunch crossing. In addition to shifting the mass peaks 
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Pythia 



Cut Set 


OvT n 


Wh efficiency (%) 


Wbb fake rate (%) 


tt fake rate (%) 


Overall Rejection Power 


Cuts 3 


0.3 


35.2 


19.1 


11.8 


2.4 


Cuts 3 


0.8 


27.0 


10.5 


10.0 


3.2 


Cuts 4 


0.3 


20.4 


8.5 


7.6 


2.6 


Cuts 4 


0.8 


15.3 


4.5 


4.2 


3.6 


Cuts 5 


0.8 


22.1 


5.1 


1.3 


8.2 



TABLE III. Background Rejection Rates at ^/s = 13 TeV. The values in the table show the signal efficiencies and fake rates 
relative to the cross sections with Basic Cuts. The Overall Rejection Power includes both the Wbb and tt. 
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FIG. 11. Jet mass distributions for signal and dominant backgrounds with pileup, JV v tx = 20. The left panels show distributions 



after the Basic Cuts of Eq. ( 13 1, while the right panel shows the same distributions with Ov2 > 0.9 Ov^ > 0.' 



by as much as 30 GeV (relative to the plots in Fig. [6]) pileup reduces the mass resolution. In a recent study of Ref. 
[21] . the ATLAS collaboration tested this feature with the LHC 7 TeV data. The mass resolution does not improve 
significantly even after cuts on the overlap are applied, again due to a low energy scale resolution. Note that reducing 
the value of a a and thus increasing the mass resolution of the Template Overlap Method could be used to study effects 
of pileup on jet observables. 

Many "post processing" pileup subtraction techniques exist in the literature, such as trimming |18j , pruning |19) and 
jet area techniques [351 US] ■ Alternatively, Ref. [UJ presents a data driven method of correcting for pileup effects for 
jet shape variable of massive narrow jets. Finally, particle tracking information can be used to subtract pileup events, 
a method already used by the CMS collaboration [IS]. In this section we do not consider any pileup subtraction. 
Instead, we show that the Template Overlap Method is largely unaffected by pileup. 

Robustness of the Template Overlap Method against pileup comes from the definition of template overlap. Consider 
for instance a single template momentum p . The core of the overlap measure is the difference 

S PT =Pt~J2p't x Ofa - AJ2y), (15) 

j 

where p J are momenta of jet constituents and 9 selects the ones which fall into a cone of radius r around p . The size 
of the template subcone r% thus limits the effects of pileup. 
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FIG. 12. Comparison of boosted Higgs template results without and with pileup (20 average interactions per bunch crossing). 
Left panel shows a Higgs jet analyzed with no pileup. The right panel is the same jet with pileup added. Grey squares represent 
the jet constituents, the pr of which is proportional to the size of the square. The solid circles are positions of 6-quarks in the 
hard process. Particles with pr < 1 GeV are not shown on the plot, but are included in the analysis. 



Matching px of a template state to the jet pr with pileup is problematic. Jet pt is shifted to higher values by pileup 
and as such is inappropriate as a criterion for template selection. Furthermore, a lower cut on the jet pt with pileup 
will include jets which would not pass the cut if pileup was not present. We instead use the px of the leptonically 
decaying W boson, as an infra-red safe, pileup independent observable (recall that since the Higgs recoils agains a W 



boson, pw ~ Ph, a s we showed in Section III I. 

We simulate the effects of pileup by adding a random number of minimum bias events (MBE) to every event we 
analyze. The number of added MBEs is distributed according to a Poisson distribution with the mean -/V vtx = 20, 



consistent with the LHC conditions at y/s — 8 TeV. Fig. 12 shows an example of the pileup effects on the overlap 
analysis of a single event. The analysis of a jet with no pileup (left panel) yields nearly identical peak template 
state as the jet with pileup (right panel). Notice that the overlap scores remained within ~ 10% of each other. On 
distribution level, the situation is similar. Fig. [13] shows examples of several template based observables with and 
without pileup. Even at 20 interactions per bunch crossing, we find no significant effects of pileup on the distribution 
shapes. In fact, the difference between susceptibility of tPf and Pf to a pileup environment is striking. While jet 
Planar Flow is significantly shifted to higher values by pileup, template Planar Flow remains mostly unaffected. 



1. Rejection Power 

The overlap method used in this study effectively reduces the area of fat jets and is therefore less sensitivity to 



pileup. As a simple application of this idea, in Fig. 14 we show fake rate vs. efficiency with the cuts of Eq. (14 1, 
obtained from Pythia data at y/s = 8 TeV and N vtx = 20. For a given cut, the efficiency is controlled by the lower 
cut on Oi>3. The results depend on the choice of cuts, but it is clear that the overall performance of overlap approach 
remains largely unchanged when compared to the case of events without pileup. Our final results can be summarized 
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FIG. 13. Effects of pileup on various jet-substructure distributions. Solid distributions were obtained with no pileup, while 
dashed distributions contain 20 average pileup events. All plots have a cut of Ov2 > 0.9 for T2 = 0.3. 



in Tabic |TV| for a few benchmark efficiency points. We choose to omit the result of adding a mass cut since it requires 
an additional mechanism for pile up subtraction, and is thus beyond the scope of this paper. In each case, we find 
background rejections comparable to our results for events wihout pileup. 



IV. CONCLUSIONS 

Hadronic decay channel of the Standard Model Higgs boson is one of the most challenging measurements in Higgs 
physics at the LHC. Traditional jet observables such as jet mass and pr are inadequate to combat the large QCD 
background as well as high luminosity environments characteristic of the LHC. Jet substructure techniques can be 
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Cut Set 


OvT n 


Wh efficiency (%) 


Wbb fake rate (%) 


ti fake rate (%) 


overall rejection power 


Cuts 3 


0.3 


37.9 


18.6 


15.8 


2.1 


Cuts 3 


0.8 


25.1 


8.7 


10.1 


2.9 


Cuts 4 


0.3 


24.5 


9.7 


9.9 


2.5 


Cuts 4 


0.8 


16.1 


4.6 


5.1 


3.4 



TABLE IV. Background Rejection Rates at ^/s = 8 TeV with AT xvt = 20. The values in the table show the signal efficiencies 
and fake rates relative to the cross sections with Basic Cuts. The overall rejection power includes both the Wbb and ti. 




FIG. 14. Background rejection power of the Template Overlap Method at y^i = 8 TeV with iV v tx = 20. The figure shows the 



overall efficiency and fake rate with fixed cuts of Eq. (141. A cut on Ov™ n runs along the curves. All efficiencies are relative 



to Basic Cuts of Eq. 1 13 1. 



used to overcome the large backgrounds in the boosted Higgs regime. We have demonstrated that Template Overlap 
Method is able to deliver sufficient background rejection power for a viable boosted Higgs search at s/s = 13 TeV. 
The method is relatively insensitive to contamination from pileup at an average of 20 interactions per bunch crossing. 
We introduced several improvements into the template overlap framework, which include varying the three-body 
subcones, sequential template generation, integration of 6-tagging identification into the peak templates and a new 
useful substructure variable denoted as template stretch. Future studies of the boosted Higgs in the framework of 
template overlap would benefit from a detailed detector simulation and more realistic 6-tagging. 



V. ACKNOWLEDGMENTS 



We thank Raz Alon, Ehud Duchovni, Ohad Silbert and Pekka Sinervo for useful comments and encouragement and 
especially Jan Winter for an early collaboration on this project. We also thank Gavin Salam for useful comments on 
the TemplateTagger code, and Ciaran Williams for discussions about MCFM. MB and JJ would like to thank the 
CERN theory group for their hospitality, where a portion of this research was conducted. The authors also express 
their gratitude to Pierre Choukroun and Lome Levinson for their enormous help regarding the computing needs of 



22 



this work. GP holds the Shlomo and Michla Tomarin development chair and is supported by the Gruber foundation, 
IRG, ISF and Minerva. 

Appendix A: More on Template Planar Flow 

To illustrate how energy smearing affects the parton level tPf consider a peak three-body template state with an 
inertial tensor I t . Energy smearing affects only the diagonal elements of I t while off-diagonals remain unchanged due 
to symmetry, leading to 

I t ^I s = I t + ^aI, (Al) 

where Pt — X)i=i Pt ano - Pt i s the transverse momentum of the i th template momentum. The effects of energy 
smearing are summarized in the symbol 



a = J <mg(pi), (A2) 



where g{pi) is the energy smearing distribution. 

For simplicity, we consider a uniform distribution over a disc of radius r 3 , giving 

r 2 

cw = y- (A3) 

Alternatively, one could also consider a Gaussian distribution centered around the template momentum, with a width 
a, resulting in 

acauss = er 2 . (A4) 

Continuing, the determinant of I s becomes 

det(J a ) = det(J t + ^ al) 

= dct(/ t (I + / t - 1 ^I)) 

= dct(/ t )dct(I + It lP ^aT). (A5) 



Similarly, the trace becomes 



tr(/ s ) - tr(J t + ^al) 



= Tr(I t )+2^ [ a (A6) 

For a jet cone radius R <~ 1 it is reasonable to assume that a«l. Furthermore, Pt/M is typically of O(l), allowing 
us to expand det(/ s ) in a. Keeping only the leading term in both the trace and the determinant we get 



Mtr(/ t ) 

Since I t is by definition a 2 x 2 matrix, we can write 



tr < 7 "» " 5SE5 " HSW' <A8 > 
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FIG. 15. Effects of showering on Planar Flow of the template momenta. The blue solid curve shows the jet Pf distribution. 
The red solid curve shows Template Planar Flow (tPf) calculated using only the three momenta from the peak template. The 
green, dashed curve shows Planar Flow distribution of peak templates with a sub cone of r$ = 0.12 taken into account as in 
Eq. ( Al |. The black, solid line is the result of varying subcone radii according to the scaling rule in Fig. fl] 



Combining Eqns. (A7) and (A8) we finally obtain 



tPf, = tPf x 



1 + 7/tPf 
1+7 ' 



where 



4 P T a 
tr(Jt) M " 



(A9) 



(A10) 



As f3 — > 0, Eq. ( All ) correctly reduces to the expression for tPf. In the limit of tPf — > 1, the showering effects become 



irrelevant as well, keeping Eq. (A7) bounded in accordance with the definition of tPf. Notice that Eq. (All) is not 



valid in the tPf — > limit, as we made an assumption that det (it) ^ during its derivation. 

Continuing, in the narrow angle approximation tr(/ t ) = M /p l T , where p l T is the total template transverse momentum. 



We have verified numerically that this result is satisfactory even at R=1.4 We can then rewrite Eq. (A7) as 

y/tpf 



tPf, = tPf X 



1 



1 + 7' 



where 



M 2 



a. 



(All) 



(A12) 



Eq. (All) provides a useful qualitative insight into the effects of showering on Planar Flow. Template momentum 



configurations with low Planar Flow are more sensitive to showering effects. This is important when considering 
Planar Flow of Higgs jets which are expected to peak at low values. 
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Fig. 15 shows the results of a data analysis with par tonic tPf, and template Planar Flow calculated a in Eq. ( Al I, 
for both varying (tPf) and fixed r$. Addition of subcones to the Planar Flow calculation produces results which match 
the physical jet distribution much better than the parton level tPf(r = 0). Notice that the match between tPf and 
jet Planar Flow is excellent in the region of Pf > 0.4, where pcrturbativc expansions arc expected to hold. 



Appendix B: Efficient Generation of Template Libraries 



Template Overlap Method is a systematic framework aimed to identify kinematic characteristics of an boosted jet. 
A typical template configuration consists of a model template, /, calculated in perturbation theory, which describes 
a "prong-like" shape of the underlying hard sub-process of a jet. Template construction typically employs prior, 
theoretical knowledge of the signal kinematics and dynamics, as well as possible experimental input. 

The simplest template configurations are the ones describing the kinematics of two-body processes such as the 
decay of SM Higgs or W/Z bosons into quark- antiquark pairs. These are easily dealt with by assuming the rest frame 
of the parent particle and producing two decay products with equal and opposite, isotropically-selected momenta 
and magnitude, subject to energy conservation. The problem of a A-body decay subtracts four constraints from the 
decay products' 3N degrees of freedom: three for overall conservation of momentum and one for energy Q The final 
states can therefore be found on a (3N — 4)-dimensional manifold in the multi-particle phase space. Note that the 
dimensionality of the template space increases rapidly with additional patrons. For instance, the two-body templates 
require only two degrees of freedom, while a corresponding four-body template space is already eight dimensional. 

The question of which kinematic frame the templates should be generated in requires careful consideration. Authors 
of Ref. |23j argued that a search for the global maximum of Ovn could be too computationally intensive. To improve 
the computation time, the template states were generated in the Higgs rest frame using a Monte Carlo routine, and 
then boosted into the lab frame. While this method worked sufficiently well for tagging a highly-boosted object(i.e. a 
1 TeV Higgs jet), it introduced residual algorithmic dependence and a certain sense of arbitrariness in the jet shape. 
At lower pj- the Monte Carlo approach samples mainly the templates within the soft-collinear region, leaving other 
regions of phase space unpopulated. An enormous number of templates is required to adequately cover the phase 
space at px ~ 0(100 GeV), thus fully diminishing the motivation for a Monte-Carlo approach. The simplest and 
most robust choice is then to generate templates directly in the lab frame and then rotate them into the frame of the 
jet axis. The result is a well covered template phase space in all relevant boosted frames. In addition, the lab frame 
templates result in a significant decrease in computation time as a much smaller number of templates are needed. 

We proceed to show how to generate the phase space for 2- and 3-parton final states as well as how to generalize 
the results to arbitrary N. 



4 For our purpose, a template is an object with no other properties other than its four-momenta. 
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1. The case of 2-body templates 

First, we summarize our notation and conventions. The model template consists of a set of four vectors, pi, • • • ,Pn, 
on the hyperplanc determined by the energy-momentum conservation, 

X>=P, P 2 -M 2 , (Bl) 

i 

where M,P are the mass and four momentum of a heavy boosted particle,! e. the Higgs. For simplicity, we treat 
all template particles to be massless. We work in an (77, <f>, pr) space, where 77 is pseudorapidity, <fr azimuthal angle 
and pt transverse momentum. Without loss of generality, we can assume that the template points in the x direction 
(77 = <f> = 0). The templates are distributed according to 

Pi =PT,i (cos ^, sin ^j,sinhr7j, cosh r/j), i = 1,2,3 (B2) 

subject to the constraint 

AT 

J2Pr=P=(PT,0,0,Ej) (B3) 

1=1 

with Ej = \J M 2 +p T . We find it useful to define unit vectors by 

Pi = (cos fa, sin </>j,sinhr?j, cosh 77$), z = 1,2, (B4) 



so that k = pro- 
phase space for the 2-body decay processes is characterized by particularly simple kinematic parameters. To 
illustrate, first note that the 2-particle templates are uniquely determined by one single four momentum, pi subject 
to the condition 

(P-P!) 2 =0. (B5) 

Writing pi = Pt,iPi, we- can solve for p T]1 in terms of the angles of the first parton 

M 2 

^ = W^KY m 

We see that a 2-particle template is therefore completely determined in terms of the unit vector pi as follows: 

M 2 

- " W^f' m 

P2=P-P1- (B8) 

Note that we can represent such a template as a point (77, 4>) in 77 — <p plane. These are the two degrees of freedom, 
in accordance with the general result that the dimensionality of the TV template space is 3N — 4. 

2. The case of 3-body templates 

A space of five degrees of freedom allows for 3-particle templates to differ from one another in more than one way 
The 3-particle templates are determined by two four momenta, pi and p 2 , subject to the constraint, 



(P - Pi - P2) 2 = 0. (B9) 



2G 

Using pi = pt,iPi and P2 = Pt,2P2, we can solve for pr,2 in terms of the angles of first two partons and pr,i, 

M 2 -2P- Pl 

PT,2 = KTE B1 °) 

2{P-p 2 -pi -p 2 ) 

A general 3-particle template is then completely specified by pr,i and two unit vectors (or, equivalently, four angles) 
pi and p2- 

3. Extension to arbitrary A^ 

A generalization to an arbitrary number of particles is straight-forward. Proceeding as above, the A^-particle 
templates are determined by pi, ■■ ■ ,Pn-i subject to the constraint, 

N-l 

(P-5>) 2 =0. (Bll) 

i=i 

Using pi = pr,iPi, we can now solve for pt,n-i in terms of the p\, ■ ■ ■ ,Pn-2 and Pn-1, 

d _ M* + 2Y^P 1 -P ] -2P-Y^- 2 P, 

Pt,n-x - — — — — — — jv=2 ( B12 ) 

2 \Pn—i ' P) 2 pn—i ' Pi 

For the special cases of N = 2 and N = 3, this formula reduces to the above results . 



4. Numerical simulations 

We choose to cover the phase space uniformly in the (3JV — 4) variables pi, ■ ■ ■ ,pn-i and pr.i, • ■ 'Pt,n-2- For a 
finite number of points this method covers regions of phase space with large Planar Flow much more uniformly than 
a Monte Carlo based approach. 

We survey the kinematically-allowed templates by fixing the total four momentum of each of the analyzed configu- 
rations and scanning over the possible values of ij>T,i) and the angles (pi) within the bounded interval. The number 
of variables depends on the number of degrees of freedom of the template states. The value of P = (px, 0, 0, Ej) can 
be imposed by an additional equation: 

N-l 

PN = P-J2Pi- ( B13 ) 

i=l 

We generate template libraries in the r\ — tj> plane with a sequential scan in steps of Ar] = A(f> — 0.05. Three- 
particle templates we rehire and additional scan over the transverse momentum with which we perform in steps of 
Apt = 5 GeV. The resulting four momenta are a requirement that they "fit" into an anti-kT jet of fixed radius 
R. Templates with particles outside the jet cone are discarded. Our choice of step sizes leads to O(10 4 ) 2-particle 
templates and O(10 6 ) 3-particle templates. 
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